Hierarchical Partitioning of Metazoan Protein Conservation Profiles Provides New Functional Insights
نویسندگان
چکیده
The availability of many complete, annotated proteomes enables the systematic study of the relationships between protein conservation and functionality. We explore this question based solely on the presence or absence of protein homologues (a.k.a. conservation profiles). We study 18 metazoans, from two distinct points of view: the human's and the fly's. Using the GOrilla gene ontology (GO) analysis tool, we explore functional enrichment of the "universal proteins", those with homologues in all 17 other species, and of the "non-universal proteins". A large number of GO terms are strongly enriched in both human and fly universal proteins. Most of these functions are known to be essential. A smaller number of GO terms, exhibiting markedly different properties, are enriched in both human and fly non-universal proteins. We further explore the non-universal proteins, whose conservation profiles are consistent with the "tree of life" (TOL consistent), as well as the TOL inconsistent proteins. Finally, we applied Quantum Clustering to the conservation profiles of the TOL consistent proteins. Each cluster is strongly associated with one or a small number of specific monophyletic clades in the tree of life. The proteins in many of these clusters exhibit strong functional enrichment associated with the "life style" of the related clades. Most previous approaches for studying function and conservation are "bottom up", studying protein families one by one, and separately assessing the conservation of each. By way of contrast, our approach is "top down". We globally partition the set of all proteins hierarchically, as described above, and then identify protein families enriched within different subdivisions. While supporting previous findings, our approach also provides a tool for discovering novel relations between protein conservation profiles, functionality, and evolutionary history as represented by the tree of life.
منابع مشابه
PathCluster: a framework for gene set-based hierarchical clustering
MOTIVATION Gene clustering and gene set-based functional analysis are widely used for the analysis of expression profiles. The development of a comprehensive method jointly combining the two methods would allow for greater biological insights. RESULTS We developed a software package, PathCluster for gene set-based clustering via an agglomerative hierarchical clustering algorithm. The distance...
متن کاملInvestigation of Genetic Variations among Crested Wheatgrass Species Base of Agronomical Traits and Total Leaf Protein
Genetic variations within the species of Agropyron desertorum and 2 varieties of A. cristatum subsp. pectinatum var. imbricatum and A. cristatum subsp. pectinatum var. pectinatum were studied using morphological traits and total protein profiles (with sodium dodecylsulphate polyacrylamide gel electrophoresis). An experiment was conducted in Research Institute of Forests and Rangelands (2012-201...
متن کاملOnline Estimation of Elbow Joint Angle Using Upper Arm Acceleration: A Movement Partitioning Approach
Estimating the elbow angle using shoulder data is very important and valuable in Functional Electrical Stimulation (FES) systems which can be useful in assisting C5/C6 SCI patients. Much research has been conducted based on the elbow-shoulder synergies.The aim of this study was the online estimation of elbow flexion/extension angle from the upper arm acceleration signals during ADLs. For this, ...
متن کاملDIAGNOSIS OF BREAST LESIONS USING THE LOCAL CHAN-VESE MODEL, HIERARCHICAL FUZZY PARTITIONING AND FUZZY DECISION TREE INDUCTION
Breast cancer is one of the leading causes of death among women. Mammography remains today the best technology to detect breast cancer, early and efficiently, to distinguish between benign and malignant diseases. Several techniques in image processing and analysis have been developed to address this problem. In this paper, we propose a new solution to the problem of computer aided detection and...
متن کاملArabidopsis leaf plasma membrane proteome using a gel free method: Focus on receptor–like kinases
The hydrophobic proteins of plant plasma membrane still remain largely unknown. For example in the Arabidopsis genome, receptor-like kinases (RLKs) are plasma membrane proteins, functioning as the primary receptors in the signaling of stress conditions, hormones and the presence of pathogens form a diverse family of over 610 genes. A limited number of these proteins have appeard in pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 9 شماره
صفحات -
تاریخ انتشار 2014